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PREFACE 

The Agriculture and Resources Inventory Surveys Through Aerospace Reinote 
Sensing Is a 6«year program of research, development, evaluation, and 
application of aerospace remote sensing for agricultural resources, which 
began 1n fiscal year 1980, This program Is a cooperative effort of the 
National Aeronautics and Space Administration, the U.S, Agency for 
International Development, and the U.S, Departments of Agriculture, Commerce 
and the Interior. 

The work which Is the subject of this document was performed within the Earth 
Resources ( Research/ Appl lea tions) Division, Space and Life Sciences 
Directorate, at the Lyndon B. Johnson Space Center, National Aeronautics and 
Space Administration. Under Contract NAS 9-15800, personnel of Lockheed 
Engineering and Management Services Company, Inc., performed the tasks which 
contributed to the completion of this research. 
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1. INTRODUCTION 


This implewentation plan proviHas for the development of sampling and estima- 
tion technology supported by the Foreign Commodity Production Forecasting 
(FCPF) project of the Agriculture and Resources Inventory Surveys Through 
Aerospace Remote Sensing (AgRISTARS) program. 

The purpose of this plan Is to specify task objectives, descriptions, scope, 
data and resources, softn^are requirements, and schedules for the fiscal year 
(FY) 1980-81 AgRISTARS^'Supported activities In the areas of multlcrop sampling 
frame development, allocation, segment selection, aggregation, and variance 
est nation which are needed to support both the short term (FYIRRO-Rl) and 
long term objectives of the FCPF project of the AgRISTARS program. Those 
tasks required to support the Integration of the overall procedures Into a 
viable sampling and aggregation methodology are also Included. 

1.1 BACKGROUND 

The FCPF project of the AgRISTARS program Is designed to develop the capabil- 
ity to p»'ocess large data sets In conjunction with objective Information 
systems for performing multicrop Inventorying using remote sensing technology. 
A rigorous development of a sampling and estimation technology Is a critical 
component of the project. 

In supporting the AgRISTARS/FCPF goals, the general sampling and aggregation 
approach Is to continuously Improve upon the existing methodology In order to 
achieve an efficient sampling strategy and reliable crop area and production 
forecasts. 

Substantial effort went Into developing an efficient sampling and aggregation 
strategy for crop acreage and production estimation for a single crop In the 
Large Area Crop Inventory Experiment (LACIC). The extension of the methodol- 
ogy to tv« or more crops occurred during the period before AgRISTARS. The 
tasks described 1n this sampling plan consist of the tasks to be ^^ddressed 
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prliBarny in FY1980-81 In support of the AgRISTARS objectives listed in 
section U2. These tasks (I through 25) are more fully described 
in appendix A. 

1.2 OB^iECTIVES, APPROACHES. AND SCOPE 

1.2.1 OVERALL SAMPLING AND AGGREGATION OWECTIVES AND APPROACH IN AgRISTARS 

For AgRISTARS, the overall objective of the sampling and aggregation proced- 
ures development program is to advance the multi crop sampling and aggregation 
approach to a level ready for inclusion in a future operational commodity pro- 
duction forecasting system. Specifically, it will support objective, timely, 
and reliable crop production forecasts at selected periods during the growing 
season for a range of crops in various countries. 

The general approach requires that each crop region be partitioned into rela- 
tively homogenous subregions (called strata) within which samples of Landsat 
data are selected and machine-processed to identify and measure the areal 
extent by crop type of the crops of interest. 

Further candidate improvements and integrations to be investigated in 
FY1900-81 for achieving a more efficient sampling and aggregation approach are 
detailed in this plan. 

1.2.2 SPECIFIC FY1980-81 OBJECTIVES AND APPROACH 

The specific objectives of the FY1980-81 tasks are to continue developments 
and refinements of various multi crop sampling and aggregation activities init- 
iated during FY1978-79. Emphasis for the FY1980-81 tasks will be on further 
refinements to the grouped optimal aggregation technique, improved approaches 
for estimating within-stratum variance (with more reliance possibly upon 
machine-derived segment-level proportion estimates as opposed to reliance on 
poor historical estimates and/or analyst estimates), more intensive investi- 
gation of alternative sampling strategies such as a full-frame sampling 
strategy, and further developments in regard to a simulated approach for 
assessing the performance of the overall -designed sampling and aggregation 
system. 


specific Usks to be addressed include: 

a. Further investigation of the optirwl sampling unit size with eniphasis on 
determining its relationship to the claissification error variance fiPrimary 
emphasis will be the integration of findings obtained at the University of 
California at Berkeley (UCB) with further In-house investigations to 
arrive at a procedure for determining the appropriate sampling unit size.] 

b. Additional developments and refinements to the grouped optimal aggregation 
technique with emphasis on improving the grouping logic and the approach 
to estimate the various input parameters such as the variance and 
covariance matrices of the historical acreage 

c. Improvements to a priori estimates of wi thi n-stratum variances and to pro- 
cedures for development of sampling frames (The utility of an automated 
procedure will be investigated for providing improvements in each of these 
areas. The capability to create crop proportion estimates at the sampi;,<g 
unit level wil'^ be investigated. This capability needs to be rapid but 
sufficiently accurate so as to improve upon the reliance of historical 
data for use in the wi thi n-stratum variance estimation.) 

d. Investigations of a full -frame sampling strategy and of a multiyear esti- 
mation for improvements in multicrop inventorying (This activity will 
require interfacing with other support contractors who provide support to 
integration of such techniques into an overall -designed system.) 

e. Determination of the feasibility of refining existing stratification 
procedures via more automated (hence, more rapid) approaches for analyzing 
Landsat data 

1.3 SCOPE 

The general scope of the FY1980-87 AgRISTARS sampling and aggregation activi- 
ties involves many different crops including wheat, barley, rice, corn, soy- 
beans, cotton, sorghum* and sunflowers over selected regions within the United 
States, Canada, India, Australia, U.S.S.R., Argentina, ar-'i Brazil. The scope 
for FY1980-8X entails primarily wheat, oarley, corn, and soybeans over 



portions of the yardstick [the U.S. Great Plains (USGP) and the major corn and 
soybean producing states In the United States] In support of deuetopment, 
test, and evalu*^tlon of a sampling and aggregation methodology for application 
In foreign areas. These applied tasks (26 through 82) are described In 
appendix B. The Initial exploratory and pilot testing are to be completed in 
the yardstick region of the United States during the FYig80-81 t1«» frame with 
some Initial exploratory Investigations Into the foreign crop and ;ns 
mentioned above. Also, the sampling and aggregation software r»H,Mtr‘>i'ints In 
support of the U.S. corn and soybeans and the U.S. and Canada spring small 
grains pilot tests will be generated for implementation on the Earth Resources 
System (ERSYS). 

Projected schedules of developmental and applied tasks according to priority, 
time allocation, and manpower needs are given In appendix C. 
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2. HUITICROP AGGREGATION PROCEDURES 


2 a THE GROUPED OPTIMAL AGGREGATION TECHNIQUE 

A modified form of a stratified random sample design Is the basis for the 
sampling strategy for these FY’s of the AgRISTARS program (FY1980-87). 
Unfortunately, large errors In crop acreage and production estimation can be 
caused because of missing data. In fact, It Is possible that all data for a 
stratum may be lost; consequently, a direct acreage estimate of the stratum 
may not exist. To address the problem, a grouped optimal aggregation 
technique has been developed by Dr. A. H. Felveson (ref. 1). Primary 
objectives of the FY1980-61 sampling and aggregation strategy development are 
to continue to Improve (by further refinement, testing, and evaluation) and to 
Implement the weighted aggregation procedure. A brief description of this 
proposed procedure follows. 


2aa CROP ACREAGE ESTIMATION 

Let a. Lx I, be the vector of the unknown current year crop acreages over a 
group of L strata and h a vector of corresponding historical acreages. 

Suppose 

■ vd (1) 


V(2lh) ■= H 

where Y Is an unknown scalar constant of proportionality. 


( 2 ) 


Suppose m of L strata has direct estimates from the sample segment data, 
1 s 1, •••, m; hence, there arc (L - m) strata having no direct estimates. 

Let d_, be the vector of direct estimates made from satellite data 

~inx I 



A 

A 

A 


1 

2 



( 3 ) 
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\ 

where is the true acreage of stratum i and \'s estimated as 

* - 
N ■ n; “1 jS ’’ij 

i * 1, 2, •••» m 


( 4 ) 


and 

A ^ 

A. “ unbiased crop acreage estimate of stratum 1 (i.e», E[A^U^] « with 
the assumptions Var(A^la^) » and Gov(A^, A^i|a^,a^i) ■ 0 for i f i' 

» number of area segments in stratum i 

n^ i= number of segments sampled from stratum i 

cnj agricultural area of a sampling unit (assumed constant) in the i^'^ 
stratum 

estimate of the crop acreage proportion for sample segment j in stratum i 


Suppose a, h, and H be partitioned in the following way. 





(L - m) X 1 


h = 


'<1 

m X 1 

(L - m) X 1 


(B) 


( 6 ) 
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( 7 ) 


«1 

m K 

H4 


\\\ 


m X 


«3 

(L 

H. 


m) 


(L •* ni) X (L - m) 


Now, <:h0 current year crop acreage is supposed to be 

.Aa,. 

A. • t 


E « a* -a 


(a) 


where 


■“r 

1 


■ Si ■ 

• 

IK 


• 

j- 


Oi) 

■^2 


(t - m) X 1 

mm 


( 9 ) 


a coUwin vector of l‘s and A. * the total current year crop acreage in the 
area of interest (L strata) . ‘>'hen, a current year weighted crop acreage 
estimator has the form 


A • 


X‘«d 


( 10 ) 


where 

» the vector obtained according to the following two conditions 
and 


HK “ A,)^ is mini mum 


ill) 


The first two moments of the estimate A, are 

EC\3 * Yh, * Y^‘*|i 

and 


m)'i' is 


(12) 


( 13 ) 
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where 


I » v(jila) 


(14) 


with 


anti 


where 


X *» 



(%i + I'aSg + ”i®i) 

(HjSa + HiSi)]/jjl(| + Hi)-^!j. 


(15) 

(16) 


(17) 


Equations (13 and 14) are the results of the minimization, with respect to X, 
of the mean square error (MSE) * E[(X*ii « In sutenary, the estimate for 

the weighted aggregation procedure current year crop acreage Is equatiou (U). 

I « X‘d 

• is* 


with its expectation 

ECA.] - Yh. . Yg’h 

and Its variance 

V(\) »J,' fx 

where X. jj> )l. fi. and )( are defined In equations 1, 3, 6, 9, 14, and 16, 

respectively. 

The approach for estimating the H nmtrix and for grouping strata to conform 
with equation (U) is discussed in section 2.2 and 2,3. The states and crops 
involved in this investigation are given in Task 2. 



2.1.Z CROP PRODUCTION ESTIMATION 

f^et Tj be the vector of strata crop yields and jj be Its estimate. Then* the 
crop production, P* and Its estimate, P, are as follows. 

P » n*o (18) 

VM rM 

and 

P . s’X’d 

where 

X 1s a matrix of weights obtained so that E[P - P]^ Is minimum 
and 

X‘h, t* h (20) 

For further details on crop production estimation, refer to reference 1. 

2.2 ESTIMATION OF MATRIX H 
2.2.1 BACKGROUND 

The grouped optimal aggregation technique requires Input for the matrix H 
which Is unknown and needr to be estimated. The current estimation approach 
Is outlined below. 

The symbols a and In equation (1) have been used to represent realizations 
of a random vector of crop acreages during 2 different ye?rs. Let a^ be the 
n X 1 vector of crop acreages In year t (t * 1, 2, •••). 
proportionality model In equation (1) can be generalized 

«t * Vt-i * 't 

» ’ 

where the are scalar constants of proportionality and 
Independent random vectors satisfying 

ECe^] *= 0 

and (22) 

E[et4 ] » 


Then, the 
to 

( 21 ) 

the are 
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which are conditional on a^.j^. in particular, one desires to estimate H|- 
where T Is the current year. 


In order to facilitate the estimation of Hy with the present mealier historical 
data supply, a simplified parameteric model Is used for M^, specifically 

- SDUgCa^.i^j, (23) 

where a(._i^( Is the l''*’ elemer'i: of at.i and o is a scalar parameter to be 
estimated. Using equation (23), e can be estimated by 


(N-ll'Cn-l)" S <®t - Vt-l'^ “til <“t ■ Vt-1> 


t»2 


(24) 


where 

T 

. e'a^^ 

y 

(25) 


® Vl 

(26) 

and 

e • (1, 1, •••, 1)^ 

(27) 

Using equation 

(24), the estimate of Hy is constructed by 



H.J. - 0 D1ag(h^^j^^, ••*, h^^^^^) 

(28) 


2.2.2 TASKS 1 AND 2 

The problem follows. If h s ai.i, then equation (28) may be a feasible way of 
estimating H s Hj; however, if h for some k > 1 (I.e., the previous 

k - 1 year of historical data are missing), there needs to be a method of 
adjusting the estimate in equation (28) to account for the degraded accuracy 
of the method in equation (1). 



The tasks for the estimation of matrlK K follow. 

a. Task 1 

The task is to develop an approach for adjusting the estimate in 
equation (28). One approach would be to use the model of equation (1) 
recursively to write afiso) in terms of (sh); •••» end 

"^T-k+l* '**» ’^T* estiiwte the variance of the error term of the new 
model . 

b. Task 2 

The new procedure of Task 1 will be applied to obtain equation (28) by 
using the 1972-76 historical crop statistics for the states of Illinoi'i, 
Indiana, and lowa^, the estimation will be determined for both corn and 
soybeans. An approach will be determined for those areas having perhaps 
only 1 year of historical data available. (This will include the possi- 
bility of using an automated approach for processing appropriate current 
and archived sampling units to support estimation of Hf») 

This same analysis will be conducted for wheat and barley over Korth Dakota 
and Minnesota. 


2.3 FURTHER TESTING OF STRATA GROUPING APPROACH 
2.3.1 BACKGROUND 

For the grouped optimal aggregation technique to be most effective, the strata 
comprising a large region must be grouped so that (a) the model given by equa- 
tion (1) approximately holds within each group and (b) the elements of the 
matrices H and i are about the same magnitude. This grouping approach, which 

IS 

affects the final estimate A, is an important element of this approach. Its 
testing is another FY1980-81 task. This g/ouping approach, in its current 
form, is explained In the following steps. 


a. Step 1 — Within L strata of u crop region where each stratum is named an 
“active group*', one selects an active group with the smallest 


(historical) acreage, say i 
estimate is V(i). 


th 


where the variance of its acreage 



b. Step 2 


c. Step 3 

d. Step 4 


2,3.2 TASKS 3 AND 4 

a. Task 3 

Since APU's ostensibly consist of political subdivisions possessing 
homogeneous agricultural and meteorological characteristics, It has been 
suggested that the grouped optimal aggregation technique will produce 
superior aggregated estimates If the groups used for ratloing are 
restricted to lie within APU’s. The task Is to compare aggregated 
estimates using restricted grouping with those computed using unrestricted 
grouping. 

b. Task 4 

The grouped optimal aggregation technique will be programmed, flowcharted, 
and documented. This modular, structured program will replace the 
existing software written by A. H. Felveson (ref. 1). The new software 
will be an existing subroutine and may be Installed on the Earth 


For each other group, say which Is eligible to be Joined 
with 1 (e.g., adjacent) and has the variance V(J), compute a 
statl Stic 

. . V(1) t V(J) , 


where V(1,J) Is the variance of the combined ^active groups” 1 
and J. Then let 

p s max p 4 (30 

J ^ 

and suppose group Jq gives that max Pj * p. 

J ^ 

— If p < I, make the 1^*^ group Inactive and choose the second 
smallest (historical) acreage active group to restart step 1. 

— If p > 1, replace groups 1 and Jq by a new group which Is the 
combination of 1 and Jq. Then, reactivate all remaining groups 
which are Inactive. This process Is carried out until the final 
grouping Is obtained. 
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Observations Division Laboratory System (EODLS) as a Statistical Analysis 
System (SAS) procedure* The partial response model to be delivered by 
lexas University (TAMU) will be one component the aggregation 
system to be installed on the EODLS. The task is to integrate this model 
into the system. 

2.4 SIMULATION OF AgRlSTARS CROP P ' AODUCTION 

In view of the limited availability of data, a full scale testing of the 

grouped optimal aggregation technique with real survey data is not feasible. 

Consequently, a simulation study will be conducted to evaluate the aggregation 

procedure for its bias and precision. 

2.4.1 OBJECTIVES 

The objectives of the multicrop simulation study are: 

a. To test whether or not the grouped optimal aggregation procedure 1$ 
unbiased and provides efficient estimates 

b. To evaluate the sampling and classification error variance components 

c. To study bias due to nonacquisition of segments 


2.4.2 TASKS 5, 6, 7, AND 8 
a. Task 5 

The simulation process could Involve the foil owl nci four factors. 

1. Crop type — They are corn, soybeans, wheat, and barley. 

2. Error simulation model — This would be exercised at a stratum level 
and a county level. 

3. Error types — Those to be considered are no errors, sampling error 
only, and sampling and classification errors. 

4. Acquisition rates — This would be In percentages of 100, 90, 80, 70, 
and 60. 

Factors 3 and 4 would form 15 configurations as shown in table 2-1. 
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TABLE 2-1.- CONFIGURATIOHS FOR FACTORS 3 AND 4 


Error type 


Acquisition rate^ percent 
ioo 90 80 70 60 


No error 

Sampling error 

Sampling and classification 
error 

Factors 4, 3, and 2 lead to 30 configurations for each crop type. In fact, 
the Input to the simulation study would be obtained from the 1978 corn and 
soybean acreage and yield data for Illinois, Indiana, and Iowa and from the 
wheat and barley for North Dakota and Minnesota. 

Results for the simulation study will be based on 100 aggregation runs in each 
case. For this study, results Include (a) average acreage and production 
estimates for each state as well as for the entire area of Interest, 

(b) estimates of the bias and variance of an acreage or production estimate 
(variance estimate for the MSE of an estimate), and (c) the accuracy goal 
achieved. 

The Monte Carlo technique will be used to generate the Inputs for obtaining 
corn/soybean and wheat/barley acreage and production estimates. Based on 
these aggregation runs, the following will be determined: (a) the bias and 

MSE of an estimate for each state as well a$i for all three or five states, 

(b) contributions of the sampling and classification error components, (c) the 
effect of nonacquTsition of segments on the estimate, and (d) other such 
factors. 

b. Task 6 

An integrated simulation system will be developed and utilized by combin- 
ing the following pieces of software: segment crop proportion estimation 
simulation, a nonresponse simulation, and a yield estimate simulation. 
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c. Task 7 

New simulation software will be developed to facilitate the simulation of 
seoment-level proportion estimates, yield numbers , and nonresponse because 
of cloud cover. The software will be three Independent subroutines and 
may be used 1n any combination to fit many applications. 

d. Task B 

Research will be conducted for the purpose of devel opine more realistic 
nonresponse models (to simulate cloud cover, etc.) than are now 
available. Development of these models will help to answer the 
question; Is significant bias from loss of data due to cloud cover 
Introduced In aggregated acreage and production estimates? 


2.d.3 SIMULATION MODEL FOR SEGMENT CROP PROPORTION ESTIMATION 

The two models considered to simulate segment crop proportion estimates are 
simulation models 1 and 11. The simulation model I Is 


^,1 ' ’’i * * 1,1 * “' 1,1 

where 

P^^ * crop proportion estimate for segment In stratum 1 
P^ a actual crop proportion for stratum 1 

6^^ a sampling error component 

♦ 

a classification error component 
with the following assumptions 

E[6^jl * 0 , V(6|j) * 

. 0 . V(e,j) « 


(31) 


(32) 
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and 


Hern, Is the correlation coefficient of and therefore, 

ECf^j3 - (33) 


and 


V(P,j) . dfg t ofj f s 4 

The simulation model U is 


’’ikj ■ ’’tk * *)kd * 


(34) 


where 

A 

^ikj * proportion estimate for segment j in county k of stratum i 

k « 1 . 2 . •••, K 

« actual crop proportion for county k of stratum i 

* sampling error component at the county level 

* classification error component at the stratum level 
with the following assumptions 

ECe^j] « 0 , V(c,j) > 

and 


( 35 ) 


Cov(S,kj,e,j) » 



Htrt« If the corrtlttlon coefficient of 6^^ end therefore, 

—2 _ I 2 2 I 

’i*t •’’i* • ®tb' 


so 


and 


®l*c ®k 


’ib ■ KT^, *^5.. - 


■ ^k 


V<P,„j) H . of., + of., t Zp,o,.,o,., 


( 36 ) 


(37) 


2.4.4 DATA INPUT AND OUTPUT 

The simulation Input will be based on the 1978 historical crop acreages and 

py78 segment data as follows; 

a. The 1978 crop acreage estimates by the U.S. Department of Agriculture 
(USDA) /Economics and Statistics Service (ESS) are to be used for 
detensiination of the P^ or P^u (i.e., for the simulation model to generate 
the crop proportion for the sample segments), 

b. The initial within-stratw variance estimates are to be used for 

determination of sampling error o? or o? . 

ie 1 »e 

c. Segment crop proportion estlmetes fre?! the 1978 crop year win be compared 
to ground-truth segment proportions, and the estimation errors will be 
used to compute o? ^ and of . 

I*C 1 d® 

d. Computations of the above parameters will be carried out at the refined 
stratum level. 

e. The historical 1972-76 crop statistics will be used for computation of 
matrix H. {This procedure is repeatable in a foreign country provided 
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th«t at least 3 years of historical crop acreage statistics are available; 
where this Is not the case, considerations as Indicated in Task Z will be 
Investigated.) 

f. Tht' strata and higher level aggregated estimates of crop acreage and 
production will be obtained using the simulated inputs. The associated 
variance estimates from the grouped optimal aggregation technique will be 
computed. The bias and repeatability of the proposed estimates will be 
determined by the Monte Carlo technique. 
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3. SAMPLING FRAME DEVELOPMENT 


3J OBJECTIVES AND APPROACH 

The objectives of the sanpllng frame development are (a) to Integrate the 
approach for creating sampling frames as developed by the USDA/ESS Into the 
overall sampling anJ aggregation methodology and (b) to develop and Implement 
an automated procedure In support of creating crop-specific strata* 

The approach consists of (a) the Implementation of the USOA/ESCS software and 
sampling frame data base Into the EODLS at the National Aeronautics and Space 
Administration* Lyndon B. Johnson Space Center (NASA/JSC) and (b) the Imple- 
mentation of the USDA/ESS-developed procedure for the overlaying of varying 
sized sampling units as vwll as the module for selection of the sampling 
units. 

In addition* an automated procedure for making crop proportion estimates at 
the sampling unit level Mill be utilized In developing an automated approach 
for creating crop-specific strata. 

3.2 SAMPLING UNIT SIZE STUDY 
3.2.1 BACKGROUND 

The LACIE sampling unit size of 5 by 6 nautical miles was considered to be 
large enough for the Classification and Mensuration Subsystem (CAMS) analysts 
to obtain wheat acreage er^lmates and small enough not to tax the computer and 
manpower resources. However* It Is time to reexamine the sampling unit size 
for the purpose of obtaining sampling unit sizes of the stratum level that are 
more adequately suited to meeting project requirements. Other significant 
facts which Influence the considered change In the sampling unit size are: 

a. The analysts are more experienced. 

b. The crops of Interest In AgRISTARS are not only wheat or small grains but 
also soybeans, corn, rice, cotton* sorghum, and sunflowers. 


3-1 



c. The areas of Interest Include not only Canada, U.S.S.R., and i USGP 
which Is the major wheat and barley producing area In the U.S* but also 
Australia, India, Argentina, and Brazil. 

d. The sampling efficiency needs to be Improved by reducing the sampling unit 
size, If necessary, since a proper size for a sampling unit can result In 
mure homogeneous sampling units and, hence, may lead to a significantly 
smaller number of sampling units requiring processing. 

Criteria to be considered In deciding upon a sampling unit size are: 

1. The accuracy of labeling and classification of segments should be con^ 
sidered as a function of size. There Is a need to examine the extent 
to which a smaller segment size leads to larger labeling and classifi- 
cation errors; It Is well known that the Interpretation of the Landsat 
Imagery, the estimation of crop signatures, and the registration 
procedure require an adequate segment size. 

2. The overall processing cost with smaller segment sizes is expected to 
Increase In comparison to that of larger segment sizes because a 
larger number of segments will need to be processed to reduce the 
sampling error. 

3.2.2 TASKS 9 AND 10 

A cost analysis study Is being conducted by the UCB to Investigate the effect 
of segment size on the analyst labeling accuracy and on the overall segment 
processing cost. Subsequently, another sampling unit size study Is envisioned 
for Robeson County, North Carolina, where wall’to-wall ground data on crop 
types ere expected to be available from the USDA. 

a. Task 9 

This task will Integrate the results from the two studies (UCB and Robeson 
County) Into a procedure for determining the optimal sampling unit size 
for multicrop estimation. 


4 
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The pufpoie of this task Is to develop and to evaluate the appi^oach for the 
integration of the t^ studies. An evaluation Mill also he made of the cost 
analysis stu4y conducted at UCB. The necessary Inputs Mill be provided on the 
experiment design for the Robeson County study (conducted by USDA/F.SS) of the 
sampling unit size. 

b. Task 10 

This task requires the designing and Implementing of a sampling frame to 
be delivered by USDA/ESS. 

The USDA/ESS Is required to develop a semiautomatic digitization and 
enumeration softMare In support of sampling frame development for transfer to 
HASA/JSC. Upon receipt of the sampling frame softMare from USDA/ESS, an 
In-house version Mill be designed, Implemented, and tested. The design Mill 
provide the capability to permit complete automation of the task of creating 
and overlaying a sampling unit grid alloMing for differing sampling unit 
sizes, plus the actual selection of the sampling units. 

It Is envisioned that the Implementation of the sampling frame Mill have an 
easy-to-use Interface Mith the SAS. A capability for "collapsing" or 
"splitting" the elemental units Mhich make up the sampling frame Mill be 
developed, tested, and Implemented (probably as a SAS procedure or, at least, 
be Interfaces Mith SAS). 



4. MULTICROP SAMPLE ALLOCATIOH 


4.1 OBJECTIVE 

The objective of the multlcrop (stratified) sample allocation task Is to 
determine the sample size n^ (h ■ 1, L; L being the number of strata) so 
that the variances of thst individual crop production estimates satisfy some 
specified precision tolerances with the cost minimized. 

4.2 INITIAL MITHIN-STRATUM VARIANCE ESTIMATES 

Before determining n-y,. one needs to estimate the within-stratum crop variances. 
A procedure for estimating the within-stratum variances (ref. 2) has been 
developed; however, It needs to be tested further for different crops and 
different areas of Interest. 

4.2.1 TASK 11 

This task requires further testing of the proposed within-stratum variance 
estimation procedure for corn and soybeans In Iowa, Indiana, and minols, for 
the data set In Robeson County, North Carolina and for wheat and bariny In 
North Dakota and Minnesota. 

Stratum variances win be obtained from the crop proportion estimates for the 
segments. These variance estimates will then be compared with those 
determined by using the proposed method. The performance of the method will 
be Judged based on this comparison and a statistical test for equality. 

4.2.2 TASK 12 

Task 10 deals with a machine classification procedure. This procedure shows 
potential for Improving Initial variance estlmanon; therefore. It needs to be 
tested by evaluating Its classification accuracy and investigating the 
feasibility of Its Implementation beginning with the automatic segment strip- 
off from the high density tapes to output of within-stratum variances. It may 
not be feasible to perform these tasks this year. 
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4.3 ALLOCATION TO SUPPORT EXPLORATORY STUDIES 


Sample allocation procedures will be determined for corn/soybeans in Brazil 
and Argentina and for spring wheat/barley in the U.S.S.R, (These tasks will 
be performed as applied tasks.) 


5. SEGMENT SELECTION AND LO'JATION 


With the sample allocations completed for the FY1981 exploratory studies, the 
tasks of selecting sample sepients and locating them on the samplinq frame 
win be undertaken. Sampling- frame software to be developed by USDA/ESS Is 
not yet available to select and locate the sample segments In (a) Brazil for 
the oorn/soybean exploratory study, (b) U.S.S.R. for the spring wheat/barley 
exploratory study as planned for FY81, and (c) Argentina for the corn/soybean 
exploratory study. Therefore, these selections will be made manually. 



6, STUDY OF ALTERMATIVE SAMPLING STRATEGIES 


Investlqatlon of some alternative sanplinq strateqles is warranted. In par- 
ticular, the most promising of these Investigations are the multiyear model 
and the full-frame sampling. 

The multiyear models for crop acreage estimation are being developed, tested, 
and evaluated at TAMU and the Environmental Research Institute of Michigan 
(ERIM), whereas consideration for development of a full-frame sampling 
strategy approach has received attention at the Laboratory for Applications of 
Remote Sensing (LARS), UCB, and ERIM. 


6.1 HARTLEY MULTIYEAR MODELS 

6.1.1 HARTLEY MIXED ANALYSIS OF VARIANCE MODEL 

6. 1.1.1 Objectives and Approaches 

The objective Is to Improve the conventional current year crop acreage propor- 
tion estimates. The multiyear models will utilize not only the current year 
sample segments but also those of many successive years In order (a) to 
explore the properties of consistency In segment crop acreages from year to 
year and (b) to deal with the systematic differences between early-season and 
late season estimates. 


A mixed analysis of the variance model will be considered wherein the current 
year direct estimates (from Landsat data) and the previous year estimates will 
be Investigated for providing a more precise current year estimate. 


6. 1.1.2 Description of Hartley Mixed Analysis of Variance Model 
The basic model (ref. 3) 1s 








®tS£ 


(38) 
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where 


s > 1 , 2 , •••, segments studied 

i « It 2, 3 for early season* mldseason* and late season 
t ■ 1 , 2 , •••* years under study 

^tst “ analyst- Interpreter(AI) estimate of the wheat proportion of 
segment s of stratum h in the crop calendar period A of year t, 

y(Ptat) » a mathematical variate transform of (say its logarithm or logit 
transform); y(Ptsjt) will be abbreviated as y^^jj 

* an effect constant of year t (representing for example, the favor- 
able or unfavorable economic outlook for wheat in year t at planting 
time or, ft least, the average transform) 

bg « an effect variable for segment s consistent through all years of the 
data bank (e.g., the soil type effect); it assumes b^ N( 0 ,o^) 

» the consistent (systematic) difference between the late-season esti- 
mate (which corresponds to 63 * 0 ) and the estimate made at crop 
calendar period A (A * 1) for early season, A * 2 for midseason) 

®tsA ** aggregate of sampling and classificati^^n errors in the trans- 
formed AgRISTARS records; it assumed e^^^^ N(O, 0 g) 

The Aitken weighted least squares estimator of e * (o^, •••, •••, 

^3) * i s 

e = X'H'ly (39) 

where 

H » I + yUU‘ 

2 . 2 

Y = OblOe 

X and U are design matrices and y is vector of ytsu's 
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(Note; Each will be utilized only for one value of tp and this will be 
^.he crop-calendar period i for which an acquisition was utilized that 
was not available at the previous forecast .t 1.) 


Equation model (38) can be utilized under three forms of mathematical variate 
transforms y(Ptsl)* 

a. The Identity Transformation 


For the identity transformation y(Ptsi|) - **tst» model will be 


tsi 


Pj + bj + «, + ejj, 


(40) 


where 




A A 

that 1s: Py w oj 


This model will help to analyze the P^. records directly; hence, aggregations 

of wheat acreage estimates aS are needed for strata., strata combinations, and 

^t 


countries. This model will give unbiased (or near unbiased) estimates Pj. If 
the additive equation (38) Is satisfied For the Ptsf 


b. The Logarithmic Transformatlcn 

For the logarithmic transformation y^jj^ = log P^sz* ^be estimate of the 
crop proportion in crop calendar period A « L and current year t = T Is 

P^gj^ * C Expio^ (41) 

where 

C - Expk (o^ + - I Var(i^ + «j) 

“tst ■ - *ts»> 



^tsA “ 

“t* \ unweighted least squares estimates of equation (38) with 

all factors being fixed. 
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c. The Logistic Transformation 


For the logistic transformation 


w - 1 1^^ "^tst 

^ts* 7 ’®3 T-VT 


tst 


A A 


A A 


Ptsl “ C'exp[2((i^ + 6^)3/ {1 + exp[2(o^ + 6^)’}} 


where 


C = 


^[^tsr ^^-^tstl] 


with 


and 


^ V(<x^) + V(«^) + 2 cov(«^. <sp 
= E(exp(2;^^yCl + exp(2y\jj)]) » FCu^s JV(y\j,)] 

Note: The efficiency Is defined by the variance reduction ratio 


R = 


Var(a^) 


Varlyy) 


(44) 

(45) 

(46) 

(47) 

(46) 

(49) 

(50) 


where y^ Is the corresponding estimate of Oj In the current year T based 
only on current year acquisitions and Oy. Is the estimate from the proposed 
multiyear model. 

6. 1.1. 3 Tasks 13 and 14 

The following tasks will be implemented using previous segment estimates for 
wheat and small grains over a 4-year period for North Dakota. 
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a. Task 13 

The requirement is to examine the feasibility of utillzinq the estimate 
qlven by the multiyear models In the vwlqhted aqqrenatlon procedure by 
replacinq the P^^'s In equation (5) with P^jj's Trom the imjltlyear 
model. The combined procedure will he called the Multiyear Welqhted 
Agqreqatlon Procedure (MYWAP). 

b. Task 14 

The requirement Is to Investigate the utilities of the imjltlyear models 
for (a) crop estimation with missing data because of cloud cover or poor 
quality Imageries and (b) early season, midseason, and late season estima- 
tion by estimating R In each cs«e. (R Is the efficiency parameter defined 
above. ) 

6.1.2 ROTATION SAMPLE DESIGNS 

The original (and simplest) model of the multiyear models Is the rotation 
sample design (ref. 4). 

6. 1.2.1 Objective and Approach 

The objective of this rotation sample derlgn Is to provide current year crop 
acreage estimates which are more efficient than the conventional current year 
crop acreage estimates by utilizing the knowledge that the variation of the 
crop acreage of a particular segment from year to year is usually less than 
the variation of the crop acreage of different segments within a particular 
year. 

The approach follows. The current year direct segment average estimates from 
the satellite data and the previous year estimated acreages are utilized to 
provide a more precise current year estimate. 
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6.1.E,E Description of Rotation Sample Designs 
The basic model (ref. 5) is 

*ts * “t ♦ '•s + *ts (“) 

for t *« 1, 2* 3, 4 years (the current year is T i 4) 

s « 1» 2, 3, 4 (s is the segment or sampling unit number) 

where 

« average true wheat acreage per segment in year t (a^'s are fixed year 
constants.) 

bg « true segment variables applicable to all years with the assumption 

bj ~ N(0,c.g) 

e ^5 * composite segiirant error variable of segment s in year t with the 
assumption ~ N(0»u|) 

» current year directly estimated (from satellite data) or previously 
estimated wheat acreage of segment s of stratum h in year t 

This segment s is observed in stratum h* Within the population of stratum h, 
there are allocated ■ S segments for each year t with a condition of S * 1, 
2, 3j or 4 segments; 1*e.» the model is applied to nominal Group I crop 
regions. The allocation of sample segments is followed by a rotation pattern. 
The authors in reference 5 investigated many different rotation patterns in 
using equation (51). The optimal pattern consists of S * 2 segments per year 
which are retained 1 segments (past year to current year). One observed 
segment will return to the sample after a 2-year absence. This rotation 
pattern is presented in figure 6-1 which utilizes four segments in 4 years. 
Figure 6-2 shows the pattern (S » 2, r « 2). 



Vt«r 


Segmnt 

Humber 



Figure 6-1,- (S « r » 3) rotation pattern [2 retained 1; T = 4]. 


Year Number 



Figure 6-2.- (S = 2, r = 2) rotation pattern. 
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QUmmt PAQE IS 
OF POOR QUALITY 



In matrix form, the basic equation (51) of the optimal rotation pattern can be 
written as 


a * Xa + Ulj. + I^ 


where 


U » 



and 

a * (a^, dg, ag, 

Is to be estimated as 
b s (bp bg* bg» b^)' ~ NI^(0» lo^) 

~ ®4P ®12* ®22* ®23* ®33» ®34» ®44^ 

a * (A^p A^p A^g, Agg, Agj, A 33 , Ag^, A^^) 

Then, it is proved that 

a « (X*H“^X)-^ X'H“^a 


(52) 


(53) 


(54) 


(55) 
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which Is the best linear unbiased estimator (BLUE) of ^ 
where 


H » I + m* 

^ A0% Ap 

Y » tcio: 


(56) 

(57) 


with being deduced from and the estimate of the variance of being 


Varli.) • i-lli (1 + -i—, - -2-.) 
’ " 1 + 2r 1 + Y 


(58) 


therefore, the estimate of the stratum h crop acreage at the year T s 4 will 
be 


Nh * 


(59) 


with its variance estimate 


Var(A^^) » Var(a^) 


(60) 


where is the population stratum h size and the current year estimate of the 
total crop production in the crop region of interest will be 


Prod, 


■i 


<*4h^h> 


(6i) 


where is the current year stratum h averi^ge wheat yield estimate. 

6. 1.2.3 Tasks 15 and 16 

These tasks will be implemented using previous estimates for wheat and small 
grains in North Dakota for a 4-year period. 

a. Task 15 

The requirement is to examine the efficiency of the rotaticr* sample design 
on the wheat data available in North Dakota. 
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b. Task 16 

The requirement is to develop an aggregation procedure tailored to the 
rotation sample designs. The combined sampling and aggregation procedures 
will be called "Rotatlon-Prestratlfled Sampling and Aggregation." 

6.2 FULL-FRAME SAMPLING STRATEGY 

6.2.1 OBJECTIVES 

The objective of a full-frame sampling strategy Is to provide an alternate, 
yet more efficient, sampling and estimation approach based on a sampling unit 
size which is as small as one pixel. LARS, ERIM, and UCB will Investigate the 
feasibility of a full-frame sampling strategy, 

6.2.2 TASKS 17 AND 18 

6. 2. 2.1 Task 17 

This task will consist of the Integration of the recommendations resulting 
from the Investigations of a full-frame sampling strategy Into the overall 
sampling and aggregation methodology. A primary part of task 17 Is the 
Implementation of the adapted procedures on ERSYS. 

6. 2. 2. 2 Task 18 

The requirement Is to develop an alternate full-frame sample design. 


f 
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7. SOFTWARE CONVERSION 


Currently, 'nost of the sampling and aggregation software and data bases devel- 
oped in the past reside on the Programmed Data Processor, Model 11/45 (POP- 
11/45) c(«nputer system. Three tasks will be Implemented to consolidate the 
sampling and aggregation capabilities on the LARS syste«n at Purdue University 
and on EOOLS Analysis Subsystem (EAS) when it becomes available. Results of 
the three tasks are to be maintained on the LARS system until the In-house EAS 
becomes available. Then similar conversions will be required for Implementing 
all software and data bases In-house. 

7.1 TASK 19 

The requirement Is to transfer the LACIC and TY sampling and aggi'egatlon soft- 
ware from the POP U/45 computer systan to the LARS systein with control 
configurations specified and procedure requirements documented. 

7.2 TASK 20 

The requiranent is to transfer from the POP 11/45 computer system to the LARS 
system all data bases consisting of agricultural statistics and CAMS segment 
estimates for the supporting USGP and foreign countries. A record of all data 
files thus created on the LARS system will be maintained. 

7.3 TASK 21 

The requirement is to conduct verification tests for the transferred softwares. 



8. ASSEMBLAGE OF DATA BASES 


8.1 TASK 22 

The requirement is to prepare a data base containing CAMS segment estimates 
for early season, midseason, and late season as well as the bHnd-site true 
crop proportions from LACIE Phases II and III. 

8.2 TASK 23 

The requirement is to assemble and file all Crop Assessment Subsystem (CAS) 
reports for LACIE Phases I, II, and III and TY. 
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9. OOCUMENTATIOM 


9.1 TASK 24 

The procedure requirements for the sampling and aggregation software win be 
specified and documented, 

9.2 TASK 25 

The preparation of this Implementation plan is Task 29, 
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DEVELOPMENTAL TASKS; SUMMARIES OF 
BACKGROUND, ACTIONS, TIME, AND MANPOWER 


APPENDIX A 


DEVELOPMENTAL TASKS; SUMMARIES OF BACKGROUND. 

ACTIONS. TIME. AND MANPOWER 

Appendix A consists of the developmental tasks In outline form. These tasks 
were generated for this Implementation Plan. 

The equations In this appendix are taken from the text of this document 
{LEMSC0-1516B) and retain the equation number given In the text. 
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PRECEDING PAGE BLANK NOT FILMED 


TASKS I AND 2: 
BACKGROUND: 


ACTIONS IN TASK 1: 

TIME; 

MANPOWER: 

ACTIONS IN TASK 2: 


TIME: 

MANPOWER: 


ESTIMATION OF MATRIX H 

The crop acreages at the year t which are to be 
should be In the inodel 

*t “ Vt-l ^ h * t ■ l,2,»«»J 

where 

E[e^|«t-l] ‘ ® 

“t* V-l.n) 

Develop a procedure to adjust the estimate In 
equation (28) 

13 weeks 

135 man-weeks 

A 

Actually estimate Hj using the procedure In Task 1 and 
the 1972-76 historical crop statistics for the states of 
Illinois, Indiana, and Iowa. This estimation will be 
done based on corn and soybean data. 

4 weeks 

45 man-weeks (If the soybean data base Is In raw form, 
two (2) more man-weeks will be needed to prepare soybean 
data.) 


estimated 

( 21 ) 

( 22 ) 

(28) 



TASKS 3 AND 4; 
BACKGROUND; 


ACTIONS IN TASK 3; 

TIME; 

MANPOWER; 

ACTIONS IN TASK 4; 


TIME: 

MANPOWER: 


TESTING OF STRATA GROUPING APPROACH 

Since APU's ostensibly consist of political subdivisions 
possessing honwgeneous agricultural and tneteorologlcal 
characteristics, It has been suggested that the weighted 
aggregation procedure will produce superior aggregated 
estimates If the groups used for ratloing are restricted 
to lie within APU's. 

Compare aggregated estimates using restricted grouping 
with those computed using unrestricted grouping. 

16 weeks 

8 man-weeks 

The grouped optimal aggregation technology will be 
programmed, flowcharted, and documented. This modular, 
structured program will replace the existing software as 
written by A. H. Felveson. The new software will be an 
existing subroutine and may be Installed on the EOOLS as 
a SAS procedure. 

April 1, 1981 (date due) 

Contact person Is George Clouette. 



TASKS 5, 6, 7, 
AND 8 

BACKGROUND: 


SIMULATION OF AgKISTARS CROP PRODUCTION 


Because of the limited data available, a full-scale test- 
ing of the grouped optimal aggregation technique Is not 
feasible. Therefore, some simulation processes are con- 
sidered for generating the estimates of the segment pro- 
portion to be used In testing. 

For P^ or P^)^ which represent the actual crop proportion 
for stratum 1 or for county k of stratum 1, the crop pro- 
portion estimate (simulated) for segment In (county 

A A 

k) stratum 1 will be P^j or 
It will be P^j where 

and (33) 

V(P,j) • oL + "ic * 
and 


0 ^^, 1s the variance of sampling error component 5^^ 

9 

Is the variance of classification error component 
and is the correlation coefficient of 6^^ and 

A 

It will be P^j^j where 


- ®1.e * 


and 


(37) 


’l.e • '4 - “?bl 


- 1 to D 

"> '^k=i ' 
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ACTIONS IN TASK 5: Use 1978 crop acreages estimated by USDA/ESS to determine 

the P^'s or Use the Initial within- stratum 

variance estimates to determine the sampling error 
2 2 

(Jle or 0 ^,^. Segment crop proportion estimates will be 

compared to ground-truth segment proportions and the 

2 2 

estimation errors will be used to compute and 

NOTE: Computations of the parameters will be 
calculated at the refined stratum level. The 
Input to the simulation study will be obtained 
from the 1978 corn and soybean acreage and yield 
for Illinois, Indiana, and Iowa. 

The stra^-^ and hig^ ^r level aggregated estimates of crop 
acreage and production will be obtained using the simu- 
lated Inputs. The associated variance estimates from the 
grouped optimal aggregation technique will be computed. 
The bias a.nd repeatability of the proposed estimates will 
be determined by the Monte Carlo process. 

TIME: Current year 

MANPOWER: 35 man-weeks 

ACTIONS IN TASK 6; Develop and utilize an Integrated simulaticyn system by 

combining the following pieces of software. 

a. Segment crop proportion estimation simulation 

b. Yield estimate simulation. 

c. Nonresponse simulation 

TIME; Current year 

MANPOWER; 35 man-weeks 
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ACTIONS IN TASK 7: Develop new iInHilatlon software to facilitate the 

slimilatlon of segment-level proportion estimates, yield 
numbers, and nonresponse because of cloud cover* The 
software will be three Independent subroutines and may be 
used In any combination to fit many applications. 

TIME: April 1, 1981 (due date) 

MANPOWER: Contact person Is George Clouette. 

ACTIONS IN TASK 8 Develop more realistic nonresponse (cloud cover, etc.) 

models than now available. (Development of these 
models will help to answer the question: Is significant 
bias from loss of data due to cloud cover Introduced In 
aggregated acreage and production estimates?) 

TIME: 32 weeks 

MANPOWER: 28 man-weeks 



PRECEDING PAGE BLANK NOT FILMED 


TASKS 9 AND 10; 
BACKGROUND; 

ACTIONS IN TASK 


TIME: 

MANPOWER; 

ACTIONS IN TASK 


DESIGN AND IMPLEMENTATION OF SAMPLING FRAME 

An optimal sampling unit size different from 5 by 6 naut- 
ical miles needs to be determined. The University of 
California at Berkeley Is Investigating the effect of 
segment size on the analyst labeling accuracy and on the 
overall segment processing cost. The USDA/ESS Is study- 
ing another sampling unit size which Is envisioned for 
use in Robeson County, North Carolina. 

9; Integrate the two studies (University of California at 
Berkeley study and the USDA study In Robeson County) and 
help In determining an optimal sampling unit size for the 
multicrop estimation for the areas of application. Then 
evaluate the cost analysis study of the University of 
California at Berkeley and provide the necessary Input on 
the experiment design for the Robeson County study of 
sampling unit size* 

26 weeks 

0.5 man-egulvalents for 6 months for coordination and 
Integration 

10; The USDA/ESS Is required to develop a semiautomatic 
digitization and enumeration software In support of 
sampling frame development for transfer to NASA/JSC. 

Upon ‘Receipt of the sampling frame software from USDA/ 
ESCS, an In-house system will be designed, Implemented, 
and tested. The design will permit complete automation; 
e.g.j the task of creating and overlaying a sampling unit 
grid allowing for differing sampling unit sizes plus the 
actual selection of the sampling units. The Implementa- 
tion of the sampling frame will have an easy-to-use 
Interface with the Statistical Analysis System (SAS). A 
capability for collapsing or splitting the element units 
making up the sampling frame will be developed, tested. 
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TIME: 

MANPOWER; 


and iwplewented (probably as a SAS procedure or, at 
least, be Interfaced with SAS), 

If USDA/ESS delivers as a package and not in bits and 
pieces, then the following will be required. 

a. 13 man-weeks for software requirements 

b. Programming needs will be: 

Design specifications — 8 man-weeks 
Coding— 2 man-equivalents for 9 weeks 
Documentation — 9 man-weeks 

c. Acceptance: 2 man-weeks 



TASKS 11 AND 12: 

i3ac;;grouno; 


ACTIONS IN TASK 


TIME: 

MANPOWER: 
ACTIONS IN TASK 


TIME: 


MULTICROP SAMPLE ALLOCATION 

The objective is to determine the sample size nj, of each 
stratum h, h - 1, 2 , •••» L, so that the variances of the 
individual crop production estimates satisfy some speci- 
fied premium tolerances with cost minimized. Before 
determining np, the within-stratum crop variances needs 
to be estimated. 

11: Initial within-stratum variance estimates — A procedure 

for estimating the within-stratum variances was nroposed 
hy Chhikara and Perry (ref. 2). Testing of the proposed 
within-stra turn-variance estimation procedure wlU be 
Implemented for corn and soybeans In Iowa, Indiana, and 
Illinois. The exploratory study (1979) segments will be 
used for this testing. Strata variances will be obtained 
from the crop proportion estimates for the segments. 

These variance estimates will then be compared with those 
which were determined by using the proposed method. The 
performance of the method will be judged based on this 
comparison and the statistical test developed for testing 
their equality. 

6 weeks 

6 man-weeks 

12: The machine classification procedure will be tested by 

evaluating Its accuracy and by Investigating the feasi- 
bility of its Implementation, beginning with the auto- 
matic segment str1p-off from the high density tapes to 
output of the within-stratum variances. 

8 weeks 


MANPOWER: 


12 man-weeks 
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TASKS 13 AND 14 
BACKGROUND; 


ACTIONS IN TASK 


TIME: 

MANPOWER: 
ACTIONS IN TASK 


HARTLEY MULTIYEAR (LOG) MODEL 

For the three forms of the base model 

ylPtst* • “t ^ * *ts« 

the logarithmic model Is favored to give the estimate of 
the crop proportion for sepents In crop calendar period 
A « L at current year t * T which Is 


P^5L = C Exp{a^ + 6^} (41) 

where 

2 

C = Exp I (o^ + />~) - I Var(a. + 6 ) (42) 

and other notation definitions given In section 6, 1.1. 2, 

13: Examine the feasibility of utilizing the estimate given 

by the multiyear models In the weighted aggregation pro- 

A 

ceduri by replacing P^j's In equation (5) of the 
weighted aggregation procedure with ^tsi ® from the mul- 
tiyear model and by developing an aggregation using the 
proportion estimates for wheat and small grain over 4 
years for North Dakota (adaption of weighted aqgretatlon 
procedure to MYWAP), 

13 weeks 

13 man-weeks 

14: Investigate the utilities of the multiyear model for 

(a) crop estimation with missing data due to cloud cover 
or poor quality Images and (b) early-season, midseason, 
and late-season estimations derived by estimating R 
(the efficiency parameter defined above) In each case. 


TIME: 13 v^eeks 

MANPOWER: 13 man-weeks 


A-14 



TASKS 1*3 ANO 16 •, 
BACKGROi'NO; 


ACTIONS IN TASK 

TIME: 

MANPOWER: 
ACTIONS IN TASK 


TIME: 


original page is 
OF POOR QUALITY 


ROTATION SAMPLE DESIGNS 

The niJltiyear nioHel OR) In its sinolest form is 

Where A^^, o^, b^, and e^^ are defined In sec- 
tion 6. 1.2, 2. 

The results qiven by this rotation sanir'ie desiqr are the 
estimates of the average wheat acreaae oer seoment In 
yean t = T where 


a « (X X H'^a OP) 


with Var(a.) s i-JJl (l + ~~1 (5R^ 

IP: Use postseqment acreaqe estimates for wheat and small 

qralns over 4 years for North Dakota to examine the 
rotation sample desiqn (f1g. 6-1 in section S). 

10 weeks 
10 man-weeks 

16: Develop an aoqreqatlon procedure tailored to the 

rotation sample designs . The combined samp! 1 no and 
aqqrenatlon procedure will be called 
"Rotation-Prestratified Samplinq and Aoqreqatlon," 

(Use the 4 years of available data.) 

10 weeks 


►'ANPDWEP: 


in man-weeks 
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TASKS 17 AND 18; 

FULL-FRA„ME SAMPLING 

BACKGROUND; 

The LARS at Purdue University is investiaatinq the fea- 
sibility of a full -frame samplinq strategy to provide an 
alternate, yet more efficient estimator, based on a 
sampling unit size which is as small as one oixel. 

ACTIONS IN TASK 17; 

¥ 

Evaluate all procedures recommended by the LARS study. 
Appraise and recommend future development and implemen- 
tation of such strategies. (This is a task-defining 
task.) 

TIME: 

8 weeks 

MANPOWER: 

12 man-weeks 

ACTIONS IN TASK 18: 

Develop an alternative full-frame sample design. 

TIME; 

18 weeks 

MANPOWER: 

18 man-weeks 

% 

♦ 
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TASKS 19. 20. SOFTWARE CONVERSION 

AND ?.U 

BACKGROUND: Cufre,itly, most of the satnpl1n<i and apqreqation software 

and data bases developed are on the PDF 11/45 comouter 
system. The following tasks will be completed to con- 
solidate the sampling and aggregation capabilities on 
the LARS system at Purdue University and on the ERSYS 
when it becomes available. 

ACTIONS IN TASKS 19. 

20, AND 21: Transfer the LACIE and TY sampling and aggregation soft- 

wares from the POP 11/45 to the LARS system with control 
configuratior.s specified and procedure requirements doc- 
umented. Transfer the POP 11/45 data to the LARS sys- 
tems for all supporting USGP and foreign countries data 
bases consisting of agricultural statistics and CAMS 
segment estimates. Maintain a record of all data files 
thus created on the LARS system. 

Conduct verification tests for the transferred soft- 
wares. These tests are to be maintained on the LARS 
system until the' in-house ERSYS becomes available. Then 
similar conversion will be required for getting all 
software and data bases implemented in-house. 

TIME: 4 weeks for data 

5 we 'ks for aggregation 
13 weeks for development 
4 weeks for multi crop allocation 

4 weeks for within-stratum variance 

MANPOWER: 4 man-weeks for data 

5 man-weeks for aggregation TY 

13 man-weeks for the development of simulation of 
AgRISTARS crop production 
4 man-weeks for multicrop allocation 
4 man-weeks for within-stratum variance 
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TASKS 22 AND 23: 
ACTIONS IN TASK 


TIME: 

MANPOWER: 
ACTIONS IN TASK 

TIME: 

MANPOWER: 


ASSEMBLAGE OF DATA BASES 

22: Prepare a data base contaiPlng CAMS segment estimates 

for early season, midseason, and late season as well 
as the blind-site true crop proportions from LAC IE 
Phases II and III. 

6 weeks 

6 man-weeks 

23; Assemble and file all CAS reports for LAC IE Phases I, 
II, and III and TY. 

4 weeks 

2 man-weeks 



TASK 24; 

ACTION IN TASK 24: 


DOCUMENTATION 


TIME: 

MANPOWER: 


The procedure requi refnents for the sampllnq and 
agqreqatlon software will be specified and documented 
(1978-80), 

12 weeks 

18 man-weeks 


APPENDIX B 


APPLIED TASKS: SUMMARIES OF CROP TYPES, BACKGROUND, TIME, 
AND MANPOWER IN VARIOUS COUNTRIES 



TASK TITLC.! U.S, CORN/SOYBEAN PILOT AGGREGATIONS FOR AREA AND PRODUCTION 

TA SK 26 ; Prepare aggregation data base 

SCHEDULE AND TIME; December 15, 1980 - March 1. 1981; 10 weeks 

MANPOWER; 10 man-weeks 

TASK 27 ; Prepare for operational implementation 

SCHEDULE AND TIME; May 1 - June 15. 1981; 6 weeks 
MANPOWER; 12 man-weeks 

TASK 28 ; Perform pilot segment allocation 

SCHEDULE AND TIME; November 1, 1979 - February 1, 1980; 13 weeks 

MANPOWER; 15.6 man-weeks 

TASK 29 ; Aggregate area and production 

SCHEDULE AND TIME; June 15 - August 1. 1981; 6 weeks 
MANPOWER; 6 man-weeks 

TASK 30 ; Prepare report for accuracy assessment 

SCHEDULE AND TIME: August 1 - October 1. 1981; 8 weeks 

MANPOWER: 12 man-weeks 

TASK 31 ; Refine baseline procedures 

SCHEDULE AND TIME: July 1 - October 1, 1980; 13 weeks 
MANPOWER; 13 man-weeks 

TASK 32 ; Refine design procedures 

SCHEDULE AND TIME; October 15, 1980 - January 1, 1981; 10 weeks 

MANPOWER; 4 man-weeks 

T ASK 33 ; Demonstrate multiyear model 

SCHEDULE AND TIME: January 1 - April 1, 1981; 13 weeks 
MANPOWER: 13 man-weeks 
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TASK 34 ; Demonstrate partial response model 

SCHEDULF AND TIME? February 1 - May 1, 1981ji 13 weeks 
MANPOWER: 13 man-weeks 

TASK 35 ; Demonstrate other procedures 

SCHEDULE AND TIME: February 1 - May I, 1981; 13 weeks 
MANPOWER; 6.5 man- weeks 

BACKGROUND; These tasks will be implemented after the Improvement is 

made to the multi crof^ sampling and aggregation procedures 
for U.S. corn and soybeans. The development, testing, and 
evaluation procedures were improved by using actual and 
simulated data from selected areas in the United States 
that approximate the corn and soybean data for Brazil and 
Argentina. 
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TASK TITLE: US. /CANADA WHEAT AND BARLEY PILOT AGGREGATIONS 

FOR WEA AND PRODUCTION 


TASK 37: 

Refine baseline procedures 

SCHEDULE AND TIME: 

1 - October 1, 1980; 13 weeks 

MANPOWER; 

13 man-weeks 

TASK 38: 

Refine design procedures 

SCHEDULE AND TIME: 

November 1 - December 15, 1980; 6 weeks 

MANPOWER: 

3 man-weeks 

TASK 39: 

Demonstrate multiyear model 

SCHEDULE AND TIME: 

January 1 - April 1, 1981; 13 weeks 

MANPOWER: 

6.5 man-weeks 

TASK 40: 

Demonstrate partial response model 

SCHEDULE AND TIME: 

February 1 - April 15, 1981; 10 weeks 

MANPOWER: 

6 man-weeks 

TASK 41: 

Demonstrate other procedures 

SCHEDULE AND TIME: 

January 1 - April 15, 1981; 15 weeks 

MANPOWER: 

7.5 man-weeks 

TASK 42: 

Prepare aggregation data base 

SCHEDULE AND TIME: 

September 15 - November 1, 1980, and December 1, 1980 
January 15, 1981; 12 weeks 

MANPOWER ; 

12 man- weeks 

TASK 43: 

Prepare for operational implementation 

SCHEDULE AND TIME: 

April 15 - June 1, 1981; 6 weeks 

MANPOWER: 

12 man-weeks 


TASK 44 ; Check and adjust allocation 

SCHEDULE AND TIME; January I - March X, 1980 j 8 weeks 
MANPOWER; 8 fnan-weeks 

TAS:; 45; Aggregate area and production 

SCHEDULE AND TIME; June 1 - July 15, 1981; 6 weeks 
MANPOWER; 6 man-weeks 

TASK 46 ; Prepare report for accuracy asce^rsment 

SCHEDULE AND TIME; July 15 - September 15. 1981; 8 weeks 
MANPOWER; 12 .iwn-weeks 

BACKGROUND; These tasks will be Implemented after the overall designed 

system is exercised by combining the component technol- 
ogies and the improved baseline multi crop sampling and 
aggregation procedures. These procedures were applied 
with actual and simulated data from selected areas in the 
“United States that approximate the data for U.S. /Canada 
spring wheat and barley. 
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T ASK TITLE; U« 
TASK 48 ; 

SCHEDULE AND TIME; 
MANPOWER: 

TASK 49: 

SCHEDULE AND TIME: 
MANPOWER: 

TASK 50 ; 

SCHEDULE AND TIME: 
MANPOWER: 

TASK 51 ; 

SCHEDULE AND TIME: 
MANPOWER: 

TASK 52 ; 

SCHEDULE AND TIME: 
MANPOWER: 

TASK 53: 

schedule and TIME: 
MANPOWER: 

TASK 54 : 

SCHEDULE AND TIME: 
MANPOWER: 


S.S>R. BARLEY SAMPLING SUPPORT TO EXPLORATORY EXPERIMENT 
Design adaptation 

December 10, 1980 - February I, 1981*, 6 weeks 
3 man-weeks 

Test data base 

March 15 - May 1, 1981; 6 weeks 
6 man- weeks 

Design a sampling scheme 
March 1 - April 1, 1980; 4 weeks 
2 man- weeks 

Oemonstrat?' multiyear model 
January 1 - April 1, 1981; 13 weeks 

6.5 man-weeks 

Demonstrate partial response model 
February 1 - May 1, 1981; 13 weeks 

6.5 man-weeks 

Demonstrate grouping logic 
January 1 - May 1, 1981; 17 weeks 

8.5 man-weeks 

Demonstrate other procedures 
April 1 - August 1, 1981; 17 weeks 
17 man-weeks 


TASK 55 ; Select segments 

SCHEOULE AND TIME: April 1 - June I, 1980; 8 v«eeks 

MANPOWER ; 4 man-weeks 

TASK 56 ; Define Foreign Similarity Region (FSR) segments 

SCHEDULE AND TIME: October 15 - December 1, 1980; 6 weeks 
MANPOWER : 3 man-weeks 

TASK 57 : Prepare report 

SCHEDULE AND TIME; August 1 - October 1, 1981; 8 weeks 

MANPOWER; 12 man-weekr^ 

BACKGROUND: These tasks require employing the improved sampling 

scheme. 



TASK TITLE: U.S.S.R. BARLEY PILOT EXPERIMENT AGGREGATIONS 
FOR AREA AND PRODUCTION 

TASK S9 ; Refine deijign procedures 

SCHEDULE AND TIME: May 13 - August I, 1981; 6 weeks 
MANPOWER: 3 man-weeks 

TASK 60 : Refine baseline procedures 

SCHEDULE AND TIME: September 1. 1981 - Ji'nuary 1. 1982; 17 weeks 

MANPOWER: 17 man-weeks 

TASK 61 ;» Prepare aggregation data base 

SCHEDULE AND TIME: September 15 - November 1, 1981; 6 weeks 
MANPOWER: 6 man-weeks 

TASK 62 : Perform pilot segment allocation 

SCHEDULE AND TIME: December 15, 1980 - March 15, 1981; 13 weeks 
MANPOWER: 15.6 man^v.^^eks 

BACKGROUND: These tasks will be implemented after the improvements are 

made to the multicrop sampling and aggregation procedures 
for the U.S.S.R. barley. The development, testing, and 
evaluation procedures were improved by using actual and 
simulated data from selected areas in the United States 
that approAimate the wheat and barley data for the 
U.S.S.R. 
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TASK TITLE; BRAZIL CORN AND SOYBEAN SAMPLING SUPPORT TO EXPLORATORY EXPERIMENT 


Study sample frame 

December 1, 1980 - August 1, ISSl; 26 weeks 
20 man-weeks 

TASK 65 ; Design a sampling scheme 

SCHEDULE AND TIME; March 1 - March 22, 1980; 3 weeks 

MANPOWER; 0.6 man-week 

TASK 66 ; Select segments 

SCHEDULE AND TIME; March 22 - April 22, 1980; 4 weeks 

MANPOWER; 0.8 man-week 

TASK 67 ; Define FSR segments 

SCHEDULE AND TIME: October 15 - December 1, 1980; 6 weeks 

MANPOWER: 3 man-weeks 

TASK 68 : Locate segments 

SCHEDULE AND TIME; April 22 - June 1. 1980; 6 weeks 

MANPOWER: 1.2 man-i^eeks 

BACKGROUND; These tasks will implement the improved sampling scheme. 


TASK 64 ; 

SCHEDULE AND TIME: 
MANPOWER: 
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TASK TITLE; 


TASK 70 ; 

SCHEDULE AND TIME; 
MANPOWER: 

TASK 71 ; 

SCHEDULE AND TIME; 
MANPOWER: 

TASK 72 : 

SCHEDULE AND TIME: 
MANPOWER: 

TASK 73 : 

SCHEDULE AND TIME; 
MANPOWER: 

BACKGROUND: 


3ENTINA CORN/SOYBEAN EXPLORATORY EXPERIMENT AGGREGATIONS 
FUR AREA AND PRODUCTION 


Refine baseline procedures; design adaptation 
March 15 - May 1, 1981; 6 weeks 
3 man-weeks 

Procedures adaptation 

August 1 - November 1, 1981; 13 weeks 

13 man-weeks 

Test data base 

June 15 - August 1, 1981; 6 weeks 
3 man- weeks 

Define FSR segments 

October 15 - December 1, 1980; 6 weeks 
3 man-weeks 

These tasks will be implemented after improvements are 
made to the multicrop sampling and aggregation procedures 
for the Argentina corn and soybeans. The development, 
testing, and evaluation procedures were Improved by using 
actual and simulated data from selected areas in the 
United States that approximate corn and soybean data for 
Argentina. 
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TASK TITLE: ARGENTINA WHEAT SAMPLING SUPPORT TO EXPLORATORY EXPERIMENT 


TASK 75 ; Design adaptation 

SCHEDULE AND TIME; March 15 - May 1. 1981; 6 weeks 

MANPOWER: 3 man-weeks 

TASK 76 ; Define FSR segments 

SCHEDULE AND TIME; October 15 - December 1, 1980; 6 weeks 

MANPOWER: 3 man-weeks 

BACKGROUND; These tasks will result in the implementation of the 

improved sampling scheme. 
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TASK TITLE; 


TASK 78 ; 

SCHEDULE AND TIME; 
MANPOWER: 

TASK 79 ; 

SCHEDULE AND TIME: 
MANPOWER: 

TASK 80 : 

SCHEDULE AND TIME: 
MANPa^ER ; 

TASK 81 ; 

SCHEDULE AND TIME: 
MANPOWER : 

TASK 82 : 

SCHEDULE AND TIME; 
MANPOWER : 


AUSTRALIA WHEAT EXPLORATORY EXPERIMENT AGGREGATIONS 
FOR AREA AND PRODUCTION 


Refine baseline procedures; design adaptation 
March 15 - May 1, 1981; 6 weeks 
3 man-weeks 

Procedures adaptation 

September 1 - December 1, 1981; 13 weeks 

13 maivweeks 

Test data base 

August 1 - September 15, 1981; 6 weeks 
6 man-weeks 

Define indicator region segments 
October 1 - November 1, 1980; 4 weeks 
4.8 man-weeks 

Define FSR segments 

October 15 - December 1, 1980; 6 weeks 
6 man-weeks 
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APPENDIX C 


PROJECTED SCHEDULES OF TASKS ACCORDING TO 
PRIORITY, TIME ALLOCATION, AND MANPOWER NEEDS 
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Define pilot dasificafion approach 




ORIGINAL PAGE IS 
OF POOR QUALITY 



C-10 


*lndicatts the task dots not Moog to the Sampling and Aggragation grawp. 






















APPENDIX D 
DATA REQUIREMENTS 



original pwe J* 


TABLE 0-1,- DATA REQUIREMENTS 


t 


r 



A - The 1972-76 historical crop states: Illinois, Indiana, and Iowa 

B - Wheat and barley in North Dakota and Minnesota 
C - Corn and soybeans in Illinois, Indiana, and Iowa 
D - The 1978 crop acreage and proportion estimates (USOA/ESCS) 

E - Corn and soybeans in Brazil 
F - Wheat and barley in U.S.S.R. 

G - The CAMS estimates 
H - LAC IE and TV software, PDP-11/45 
I - Softvwre design and implementation 
J - Initial within-stratum variance estimates 
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K - Sampling frame, USDA/ESCS 
L - Theoretical research 
M - Monte Carlo technique 
N - Documentation and reports 
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